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METHODS OF IDENTIFYING ORGANISM BASED ON ITS GENOTYPE 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a method for identifying 
a species or homology of an organism such as microorganism by 
its genotype . 

2 . Description of the Related Art 

Organisms which include microorganisms have been 
identified so far basically using their phenotypes . However, 
identification by phenotypes was not suited for distinguishing 
organisms more accurately. Particularly, there was practical 
limitation upon identifying/ specifying microorganisms 
comprising a large number of species by phenotypes. Our daily 
life is surrounded by microorganisms. Microorganisms such as 
Escherichia coll 0157, Mycobacterium tuberculosis , 
methicillin-resistant Staphylococcus aureus (MRSA) , and ViJ^rio 
cholera cause many diseases, and a technique for accurately 
identif yingmicroorganisms is needed for establishing effective 
treatments and specifying infection pathways. It is also 
recognized that soil bacteria are involved in the productivity 
and quality of an agricultural product , and that the human health 
is greatly influenced by the enteric bacterial flora. Accurate 
studies have not carried out, however, concerning the relation 
between the productivity or quality of an agricultural product 
and the kind, quantity, and combination of soil bacteria, and 
the relation between the human health and the enteric bacterial 
flora . This is due to the fact that it is impossible to accurately 
identify/distinguish microorganism ( s ) by the conventional 
identification by its/their phenotype ( s ) . 

In the above-mentioned situation, identification of 
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microorganisms by genotypes is proposed instead of that of 
phenotypes . 

It is quite possible to identify/distinguish 
microorganism (s) by comparing the (whole) genome of each 
microorganism using the recent sequencing technique level. It 
needs, however, a significant amount of work and time, and is 
not a simple method. Therefore, identifying/distinguishing 
species of microorganisms by comparing the (whole) genome is 
practically not a widely applicable method. Although a method 
for comparing a part of a genome is known as a more simple method, 
enough information is not obtained for 

identifying/distinguishing species by comparing the 16S rRNA 
sequences . 

SUMMARY OF THE INVENTION 

It is therefore the object of the present invention to 
provide a method for identifying the species, homology, and 
so on of organisms such as microorganisms, which is simple to 
some extent and practically workable , and is based on genotypes . 

The present invention is directed to a method for 
identifying an organism comprising the steps of: 

1) preparing one kind or more of double-stranded DNA fragments 
by the random PGR using, as a template, at least a part of a 
genome of an organism which is to be identified, 

2) subjecting the double-stranded DNA fragments prepared in 
step 1 to temperature gradient gel electrophoresis (TGGE) or 
denaturant gradient gel electrophoresis (DGGE) , 

3) extracting identification dots of each DNA fragment from 
an electrophoretic pattern which was obtained in step 2, 

4) determining PaSS and/or genome distance (s) from the 
identification dots which were obtained in step 4, and 

5) analyzing the PaSS and/or genome semi-distance (s) which 
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was/were obtained in step 4, 

wherein in the electrophoresis by TGGE or DGGE, a standard DNA 
is co-existed as a standard point for the identification dots 
and the pseudo-absolute location of the identification dots 
is determined from the locational relation to the standard DNA. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The above and other objects, aspects, features and 
advantages of the present invention will become more apparent 
from the following detailed description when taken in 
conjunction with the accompanying drawings, in which: 

Fig.l illustrates examples of genome profiling images 
(an electrophoresis pattern which is obtainedby TGGE of standard 
DNAs which are shown by Seq. ID Nos. 1 and 2) ; 

Fig. 2 illustrates the first typical band curve; 
Fig. 3 illustrates the second typical band curve; 
Fig. 4 illustrates eight featuring points Pj (j = 1-8) which 
were extracted from three band curves; 

Fig. 5 illustrates 'n' featuring points Pn (i = 1-n) with 
respect to a specimen microorganism which corresponds to 'n' 
featuring points Poi ( i - 1-n) with respect to a reference genome ; 
and 

Fig. 6a illustrates an electrophoresis pattern in case 
cy3-pfM12 (5' cyS-AGA ACG CGC CTG 3') which was obtained in 
Example 2 was used as a primer, and Fig. 6b illustrates an 
electrophoresis pattern in case FITC/UCS (5' FITC- CA GGA AAC 
AGC TAT GAC 3') was used as a primer. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
(1) Preparation of double-stranded DNA fragments from genome 
by random PGR 



s c 



f 
( 



5 



C3 



The random PGR is a method which was developed by the 
inventors . 

The normal polymerase chain reaction (PGR) method is a 
method for amplifying a specific DNA region by repeating DNA 
synthesis using two kinds of primer which sandwich a specific 
DNA region and DNA polymerase. Specifically, 1) a 
double-stranded DNA is denatured by heat treatment at 90 °G or 
so to give single-stranded DNAs (denaturing) ,2) a primer 
attaches to the obtained single-stranded DNA by annealing at 
50**G or so, and 3) a DNA chain is synthesized by DNA polymerase 
using the primer as the starting point using the single-stranded. 
DNA as a template using monomer nucleotides as raw materials 
at GS'^C or so (elongation of strand) . Only a double-stranded 
DNA fragment is amplified which corresponds to a sequence in 
a specific DNA region which was sandwiched between parts which 
have sequences which are complementary to the two kinds of primer 
by repeating a cycle comprising the denaturing, the annealing, 
and the elongation. 

The above-mentioned PGR method is a method for amplifying 
only a DNA fragment in a specific DNA region. Therefore, the 
condition for the above-mentioned annealing is designed so that 
the primer can bind to a specific part of a single-stranded 
DNA which has a sequence which is complementary to the sequence 
of the primer (e.g. , a pair of G and G or a pair of A and T) . 
Namely, the condition for the annealing is designed so that 
a primer and a single-stranded DNA can form a double strand 
without causing a mismatch between the primers and the 
single-stranded DNA as a template. DNA strand can be amplified 
even by carrying out the above-mentioned annealing at about 
30 °G and carrying out the elongation of strand at about 50 "G. 
However, the temperature for the annealing is low, so. that 
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mismatch (s) (i.e. , not complementary; e.g. , a pair comprising 
G and A) can be contained in a part of the base pairs between 
a primer and a single-stranded DNA as a template. Normally, 
only a DNA fragment which has a absolutely complementary sequence 
can be a primer and allows amplifying the strand. On the contrary, 
if the annealing and the elongation are carried out at a 
relatively low temperature as described above, even a DNA 
fragment which does not have a completely complementary sequence 
can be a primer and allows amplifying the stand. 

Therefore, if a cycle comprising denaturing, annealing, 
and elongation (wherein annealing and elongation are carried 
out at a lower temperature than the normal PGR) is repeated 
using DNA such as genome DNA as a template using a DNA fragment 
which has a specific sequence (e.g. , a fragment which is 12mer 
or so , and is selected independent of a DNA sequence of a template) 
as a primer , even a DNA fragment which does not have a completely 
complementary sequence can be a primer, and the strand is 
amplified, so that one kind or more of double-stranded DNA 
fragment which can contain mismatch (es) in a primer part is/ are 
formed. The purpose of this method is not to amplify a specific 
region on a template, and a strand is amplified if it can be 
a starting point for the elongation even if mismatches are 
contained to some extent, so that two kinds or more of 
double-stranded DNA fragment can be formed. In addition, it 
is found that if the same DNA which is used as the template 
and the same DNA fragment which is used as a primer and the 
same cycle conditions for denaturing and annealing are used, 
DNA strand (s) which has/have the same sequence ( s ) is/are always 
amplified. This method is called ^Random PGR' because the purpose 
of the method is not to amplify a specific region on a template. 

By the random PGR, various DNA strands can be obtained 



in amplification by using a primer DNA fragment different in 
a sequence even if the same genome DNA is used as a template, 
or by changing the condition for the cycle of denaturing, 
annealing, and elongation. Especially obtained DNA strand (s) 
is /are largely changed by both length and sequence of the primer , 
so that it is preferable to choose an appropriate primer. 
Versatile sequence information of a genome can be taken out 
by changing a nucleotide sequence of a primer taking advantage 
of the character. 

A method according to the present invention allows 
amplifying and preparing one or more kind (s) of double-stranded 
DNA fragment by the random PGR using as a template at least 
a part of a genome of a microorganism whose species and other 
properties are to be identified. At least a part of a genome 
which is used as a template in the random PGR can be prepared 
from a microorganism according to generally used methods. At 
least a part of a genome which is used in the random PGR as 
a template can be prepared, for example, by the alkali method 
(H . Wang, M. Gin, A.J. Gutler , Nucl . Acids Res .,21, 4153 (1993)). 

The random PGR in which an obtained genome is used as 
a template can be carried out as follows: 

Apart from the normal PGR method, an appropriate sequence , 
length and kind of a primer can be appropriately selected for 
the random PGR. Experimentally and analytically the most simple 
system, however, is the case where a relatively short single 
primer is used. Even in the case a single primer is used, and 
agenome which is derivedf rom the same microorganism is amplified, 
a kind and quantity of a double-stranded DNA fragment which 
can be amplified depend on the sequence and length of the primer . 
In addition even in the case a single primer and a primer which 
has the same sequence and length are used, a kind and quantity 



of a double-stranded DNA fragment which can be amplified depend 
on the microorganism from which the genome is derived." 
Considering these facts, sequence, length, and kind of a primer 
are appropriately selected . Too longprimer can have the hair-pin 
structure, can prevent annealing, and tends to lower a yield 
of a PGR product . Too short primer does not adequately stabilize 
a bonding containing mismatch (es) , and tends to lower a yield 
of a PGR product. A length (base length) of a primer is suitably , 
for example, 8-20, preferably 10-16, more preferably 10-12, 
and most preferably 12. An oligonucleotide which has a length 
of 12 bases can be used as a primer . Such oligonucleotide primers 
include pfM4 (dCAGTCAGGAGGT) , pfM12 (dAGAAGGGGGCTG) , pfM14 
(dCGTCGGTATTAA) , pfM19 (dCAGGGCGCGTAG) , (dA) 12 ( d AAAAAAAAAAAA ) , 
(dAaTa) 2 (dAAATTTAAATTT) , (dAATT) 3 (dAATTAATTAATT) , (dACG) 4 



(dACGACGACGAGG) 
(dCCCCCCCCCCCC) 
(dGGGGGGGGGGGG) 
(dGGCCGGCGGGCC) 
(dTTTTTTTTTTTT) 
(dTGCTGGTGCTGC) 
( dGGGCGGCGACCT ) 
(dGGGGTCGAGGGG) 



(dAT)6 
(dCGGG) 3 
(dGA) 6 

(dCT) 6 

(dT3G3) 2 
(TA) 4C2AC 
Cohesive2 



( dATATATATATAT ) , 
(dCCGGCGGGCCGG) 
(dGAGAGAGAGAGA) , 
(dCTCTCTCTCTCT) 
(dTTTGGGTTTGGG) , 
(dTATATATACCAC) , 
(dAGGTCGCCGCCC) 



(dC) 12 
(dG) 12 
(dGGCC) 3 
(dT)i2 
(dTGC) 4 
Cohesivel 
G4sand 



GCTA9 (dGCTAAAAAAAAA) , notG (dCAATTCTACAAC) , 
notT (dACGAGCGAGCGC) , Promotel (dTATAATTATAAT) , Promote2 
(dATTATAATTATA) , SDl (dGATCACCTCCTTA) , SD2 ( dTAAGGAGGTGATC ) , 
Telomerel (dCCCACCCACCCA) , Telomere2 (dTGGGTGGGTGGG) , 
FITC17-H-3' (5' GAGGAAACAGCTATGAGATCTTCTC3') , FITC17-H-5' 
(5' GAG GAA ACA GCT ATG ACG TTC TCA C 3') , LH-7-3 ' (5' GGC GAT 
ATC GCT GAA A3'), LH-7-5 ' (5' TAT TAT TTC CGC AAA G 3') , M13 
Reverse (5' CAG GAA ACA GCT ATG AC 3') , cy3-pfM12 (5' cy3-AGA 
ACG CGC CTG 3') (with cy3 fluorescence), FITC/UCS (5' FITC- 
CA GGA AAC AGC TAT GAG 3') (with FITC fluorescence) , MAl-FITC 
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FITC- TGC TAC GTC TCT TCC GAT GCT GTC TTT CGC T 3 ' ) (»ith 
fluorescence, , cyS-MAl <5,. cv3- TGC TAG GTC TGT TGC GAT 
OCT GTC TTT CGC T 3 ' , (with =y3 fluorescence, , HEX-pfMll ,5 
HKX- GAA CCT CCC GAC 3 ■ , (with He. fluorescence, , TAM-TGC4 <5. 
TAM- TGC TGC TGC IGC 3 ' , (with Tamara fluorescence, , and the 
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in the random PGR, as described al>cve , PCR is operated 
at a re^arKably lower temperature than the normal PCR while 
other operational conditions for the random PCR are 
substantially the same as the normal PCR. Reaction mixture and 
reaction condition for the random PCR are as follows: 100 . 
1 of reaction mixture contains 200 «M dNTP (N - G, A, T, C, , 
0 S .M primer, 10 m« Tris-HCl ,pH 9 . 0, , 50 mM KCl . 2.5m«MgCU, 
0 1% Triton X-100, 0.02 unit/ f. 1 Taq DNA polymerase, and an 

. r, 3 ul Of a DNA solution which was 
appropriate amount (e.g., J n r oi 

prepared in a way which is described elsewhere rs added an 
the total volume of the reaction mixture was adjusted to 100 
, 1, Of template 0«A. PCR comprises 20-30 cycles of 
steps: treatment at 94-C for 1 min, denaturatron at 94 C or 

nr^rr ;,t 28°C for 2 min, and elongation at 4/ u 
30 min, annealing at ^ 

4- ->■»- AIT fnr 2 min. When prMl4 
for 2 min, followed by treatment at 47 C for ^ m 

, rj-r ,-ontent is used as a primer, annealing can 
which has a low G+C content is u» 

be carried out at 23-C, elongation can be carried out at 
for 10 mm, and the final chase treatment can be carried out 
at 42 -C for 10 min . For the experiment which was described above , 
it is preferable to carry cut preparation and mixing of the 
reaction mixture on a clean bench, and to irradiate UV light 
,312 nm for 10 min, to a solution (which contains all except 
template DNA, immediately before adding template DNA to prevent 
formation of PCR products which are derived from impurities . 

For the above-mentioned random PCR, DBA labeled with a 
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fluorescent marker can be amplified by using a raw material 
with a fluorescent marker. In this case, using a fluorescence 
labeled DNA as a standard DNA, identification dots (as described 
below) can be extracted by means of the image processing using 
fluorescence markers carried by the DNAs . A primer or nucleotide 
can be the raw material labeled with a fluorescent marker. A 
primer or nucleotide labeled with a fluorescent marker can be 
purchased or easily synthesized according to well known methods . 
(2) Electrophoresis of double-stranded DNA fragment by TGGE 
or DGGE 

According to the present invention, the double-stranded 
DNA fragments obtained in (1) are subjected to temperature 
gradient gel electrophoresis (TGGE) or denaturant gradient gel 
electrophoresis (DGGE) . 

Both TGGE and DGGE are well knownmethods . TGGE is described 
in detail, for example, in R. Riesner, et al . , Electrophoresis 
10, 377-389 (1989) . DGGE is described in detail, for example, 
in E. S. Abrams , V. P. Stanton, Jr., Methods Enzymol . , 212, 
71 (1992) . Bothmethods canbe carried out easily using commercial 
apparatus. Both TGGE and DGGE are two-dimensional 
electrophoresis using a slab-type gel providing a temperature 
gradient or denaturant concentration gradient along and 
vertically to the electrophoresis direction of the 
double-stranded DNA. TGGE and/or DGGE allows understanding the 
behavior of melting (dissociation of two strands) which is 
characteristic to the nucleotide seguence of the 
double-stranded DNA. Namely, the strength (heat resistance or 
denaturant resistance) of bond of base pairs in a region which 
exists in the double-stranded DNA to be subjected to 
electrophoresis can be visualized by TGGE or DGGE. 

For TGGE, a temperature gradient from 30 °C to 70**C is 



...d For DGGE, a denaturant such as urea and 
normally applxed. For DOC. , ^ ^.^dient for example, 

f ormamide can be used with a concentr 

,3...o a Lo..« po=...o. an ..en....ca..on ^ ^ ^^^^^ 

^ . ^^^^tAon to standard DNA which is addea 

loca..cnaX «Xa.xon ^^^^^^^^^ 

which was ampl.£«d ^ identification dot, and 

.o-e.isted as .L^.ho.esis, whe.e.. the 

then the mixture is sub] ectea c:orresponding 

--------;:r::r;o:::::rz:.on - 

,o its sequenoe. ^"^^ J" be determined 

identification dot of each double st.a 

fro. the locatlonal relation to the elec P 

- --"^ „::rr:":: anai^m, 

...naara spec..en^ U^. ......cphoresis and have 

deviation to some extent ^^^^^^ J ^^^^^ correcting 

------'^■r: lal r ectrop.oresis pattern and 

data which are obtained 

^°^^"::^™phoresis bv .00. or 0.0. at least a part o. 
doublelranded ... has the -^^-^'----^r, r 

- . dot Where a ril.ration speed 
SS LTs n.le-stranded ... as .eaturin. 
" ^°^^;r3e ;eaturU points cannot b^ 

points . These featuring preferable that standard 

condition for electrophoresis. It is pr 

1., observing featuring points sucn 
DNA allows clearly observi g 

^r,+- the slowest point, ana 
melting-starting pomt, the 
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„„.U.t.-conve.sin, point un.e. an adop.ea - 
.onai.ion fo. .ivin, a standard point. In add.t. ^ 
preferable that the standard Dm gives a pattern wh.ch can he 

. . from that of double-stranded DUK which 

easily distinguished from that 

1. derived fro. the genome «hich is to be ^^-"^"^^ 

Standard 0» should be a set of DNA (double-stranded D«A, 
hair pin-type DNA/R«A. or the li.e, which gives clear transition 
ne r each Of both ends, for example, within a predetermined 
t e gradient (or concentration gradient, , and be one 
temperature 9"^-"^ ' . transition range of 200-1,000 

which is not overlapped m the -craua 

base long for normally observing the transition. Two Kinds or 
% more of Itandard OK. can be used for providing more accurate 
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standard point (s) . cr.r„n^nre 

«ot only a sequence shown by SEQ ID «o . 1 or 2 - 
.isting but also other se^ences can be used for ---- -- 

Electrophoresis patterns which were obtained by TGGE with 
standard DNAs of SEQ ID «os . 1 and 2 are illustrated in Fig 1, 

■ in Table 1. The condition 
and their properties are summarized m Table 

andthei p P . tcge was as follows : an electrophoresis 

for the above-mentioned TGGE was as tox 

apparatus (Taitec TG-180, manufactured by Taitec, Japan) , 
p! l ac lali^^ (BMurea, .0 mM Tris buf f er .pHB . 0> ) linear 

temperature gradient from 30»C to VO = C, 300V, and 90 mm. 
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Table 1 



Restored 
temperature 



Ti CC) 



T^ll^l 



SEQ No. 1 



60 



67 



SEQ No. 2 



70 



72 



of species identification dot from 
(3) Extraction of species 

electrophoresis pattern , 

slnds (electrophoresis pattern, of DK. (including 
standard 0«., on an electrophoresis gel which were obtained 
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by TGGE or DGGE are visualized, for example, by silver staining, 
and identification dots are extracted from each visualized 
pattern . 

Silver staining of the gel can be carried out , for example , 
according to Boulikas and Hancock (T. Boulikas, R. Hancock, 
J. Biochem. Biophys. Methods, 5, 219 (1981)) with an improvement 
using PEG treatment according to Ohsawa and Ebata (K. Ohsawa, 
N. Ebata, Anal. Biochem., 135, 409 (1983)). 

The method will be outlined below: 

1) Gel which is attached to a gel bond film is transferred to 
aplastic container which contains 200ml of a 30% PEG2000 aqueous 
solution , and the obtainedmixture is stirred at room temperature 
(15-30OC) for 30 min . Meanwhile the gel is separated from the 
film, is shrunk, and becomes white and translucent. 

2) The solution is once substituted for 150 ml of distilled 
water, and the gel is rinsed adequately. This procedure is 
repeated twice. The rinsed liquid is completely sucked with 
an aspirator. 

3) After the rinsed liquid is removed, about 200 ml of silver 
stain solution (10 ml of IM NaOH and 2 ml of 25% ammonia are 
added to 200 ml of bidistilled water, and mixed, and 0.4 g of 
silver nitrate is added to the mixture) is poured, and the 
resultant mixture is stirred. 

4) The stain solution is substituted for 200 ml of bidistilled 
water, and the water is stirred for 1 min, and the water is 
discarded, and this procedure is repeated again. 

5) The liquid is substituted for about 200 ml of a developer 
(which was prepared by adding 200 1 of 10% citric acid and 
200 ;il of formalin to 200 ml of bidistilled water, followed 
by mixing) , and the gel is gently shaken until the bands are 
stained so as to have appropriate density. 
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6, «hen the bands are stained so as to have appropriate densrty, 
the developer is quioKly removed, and 200 ml o£ a stop solution 
which was previously prepared ,10% acetic acid . 40% methanol, 
in water, is added, and the gel and the licuid are sha.en for 
about 10 min to give a stained gel specimen (the waste developer 
Should not be dried because explosive fulminating silver can 

be formed) . . 

/ ^ 4= i -I- i <=; 21 na and 300 base 
several ng of DNA fragments (if it is 21 ng 

long it is about 10 fmol, i.e., 6x10^ molecules) which exist 
in 100 M 1 of PGR solution can be easily detected by the treatment 
above (even one-tenth of that can be detected under a good 
condition) . 

in case an enough amount of DNA is available (several 
tens ng/band) , EtBr (ethidium bromide) method can be applied 
i e a gel after electrophoresis is washed with water , soalced 
in a's Mg/-1 EtBr solution (for about 10 min) , and directly 

observed under a UV lamp (at about 360 nm) . 

Alternately, species identificationdotscanbeextracted 

by means of a fluorescent marker carried by DNA, i . e . , by using 
a raw material labeled with a fluorescent marKer for random 

1 ■ ^ , nMA labeled with the fluorescent marker m the 
PGR to amplify DNA laoeiea wxi-i 

^^r,. vr-Q of <;teD 1- The raw material 
amplification process by random PGR of step 

with a fluorescent marker can be a primer or nucleotide whrch 
becomes a substrate for polymerase. Both a primer labeled 
with a fluorescent marker and a polynucleotide labeled wrth 

n 1 ^r.^ ainH ;=»r-^ easily commercially 
a fluorescent marker are well-known , and are easily 

aval lable. 

using a raw material for random PCR, for example, a prrmer 
labeled with a fluorescent marker, subjecting to 
electrophoresis D«« which were separately amplified usrng 
fluorescent markers being different in excitation and 
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fluorescence wavelengths on one plate, and using an excitation 
wavelength and a fluorescence wavelength which have different 
patterns for each DNA allows separately detecting each of the 
DNAs . Specifically, the obtained amplified DNA can be detected 
using primers A, B, and C for amplifying DNAs of organisms a, 
jb , and c, respectively, which belong to different species one 
another, wherein primers A, B, and C have the same sequence, 
but have different fluorescent markers, using an excitation 
wavelength and a fluorescence wavelength corresponding to the 
fluorescent marker of primer A with respect to organism a ; using 
Q an excitation wavelength and a fluorescence wavelength 

corresponding to the fluorescent marker of primer B with respect 
to organism b; and using an excitation wavelength and a 
fluorescence wavelength corresponding to the fluorescent marker 
of primer B with respect to organism c allow detection of each 
DNA. Alternately, amplification of DNAs of an organism is made 
P by using primers A, B, and C which are different in the sequence 
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and fluorescent marker and the obtained amplified DNAs can be 
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Q detected using an excitation wavelength and a fluorescence 



wavelength corresponding to the fluorescent marker of primer 
A; using an excitation wavelength and a fluorescence wavelength 
corresponding to the fluorescent marker of primer B; and using 
an excitation wavelength and a fluorescence wavelength 
corresponding to the fluorescent marker of primer C. 

Herein 'identification dots' refers to inflection dot, 
isomobility dot, and the like which each pattern (curve) has. 

Identification dots (e.g. , inflection dot and isomobility 
dot) can be chosen, for example, in the following way: 
1) A dot where melting begins (Ti) is determined as the maximal 
value of second derivative or the middle dot between two dots 
which give second derivatives of 0 . 
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2) A discontinuous dot (dot which gives a large change in Y 
for a slight increase in X) is determined to give Tmin (minimal 
mobility dot) . 

3) Dot Tc where a DNA arrives at a mobility of a single-stranded 
state (dot where a DNA first arrives at the final mobility) 
is determined. 

The above-mentioned operation comprising steps 1-3 is 
carried out from a clear band by turns to give, for example, 
10-12 dots as a whole although the number of band can be 
appropriately chosen. If the species is already known and the 
S band which corresponds to the species is known, the 

above-mentioned operation can be carried out with the previously 



specified band(s) . 

Specifically, identification dot(s) can be extracted by 
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the image processing using a CCD camera and a computer. For 
the image processing, an image of an electrophoresis pattern 
which was visualized by the above-mentioned silver staining 
is imported via an appropriate means for taking picture into 
Q an appropriate electric recording medium as a color image or 

^ a gray-scale image . Specifically , an image of an electrophoresis 

pattern can be imported into a computer as a genome profiling 
image using a means for taking picture such as digital camera. 
Fig.l illustrates an example of genome profiling image of a 
sequence for inner standard as described below. 

Genome profiling images are normalized on a computer, 
if necessary, after distortion is corrected. Correcting 
distortion can be omitted if the distortion of a genome profiling 
image is within a normal error range. Normalization is the image 
processing in which ordinate (y-axis; mobility) and abscissa 
(X-axis; temperature or concentration) directions are 
normalized based on species identification dots of an inner 
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standard reference specimen , experimental condition parameters , 
and so on. This normalization makes mobility M dimensionless 
so as to have a value from 0 to 1 . For temperature T, effective 
temperature which is obtained by converting a denaturation 
efficiency of urea which is present in a gel into temperature, 
i.e.^ restored temperature is applied. 

After normalization, lines are extracted, and the 
extracted lines are f unctionalized . Lines (bold lines) are 
extracted by so-called ream-forming reference dot shift method 
with respect to each of some bands in genome profiling images. 
In case a band of interest is continuous before and after 
denaturation, a bold line can be extracted by specifying a dot 
on the band. In case a band of interest is discontinuous before 
and after denaturation, however, a bold line is extracted by 
specifying two dots before and after denaturation. A bold line 
can also be extracted using the binarization method instead 
of the ream- forming reference dot shift method. 

A bold line corresponding to each band which was extracted 
from a genome profiling image is then slimmed. Slimming is a 
process in which a bold line is converted into a slimmed line 
which is a single-valued function. In slimming, a slimmed line 
is obtained which consists of smooth chain (curve) which consists 
of many centers of gravity by the smoothing process in which 
an average coordinate of y-coordinate of an gravity and 
y-coordinate of two adjacent gravities is replaced for the 
r-coordinate of the gravity , for example , by the running average 
method . 

The slimmed line is f unctionalized . In the 
functionalization , each slimmed line is approximated to an 
n-order function which has an appropriate order (e.g., 
ten-order) or trigonometric function using, for example, the 
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Newton-Rapson method. 

Featuring points were extracted for slimed lines and for 
each band curve which has been f unctionalized . Extracted 
featuring points include melting-initiation point Pint , minimal 
mobility point Pmin, and mobility end point Fend- 
Melting-initiation point is a point on a band curve which 
corresponds to the initiation of melting before denaturation . 
Minimal mobility point Pmin is a point on a band curve at which 
the mobility reaches the minimum. Mobility end point Pend is 
a point on a band curve at which a DNA reaches the mobility 
in a single-stranded state after denaturation. 

Fig. 2 illustrates a mode of the first typical band curve. 
As shown in Fig. 2 , in typical first band curve 51 , mobility 
M increases nearly linearly with temperature T at lower than 
the temperature (Tmi) at which the melting begins. The 
temperature reaches the temperature (Tini) which corresponds 
to melting initiation point Pmi, melting began, and mobility 
M decreases nearly linearly with temperature T, and the 
decreasing rate is slowly decreased, and a DNA reaches minimal 

mobility point P min • 

After the temperature reaches the temperature (Tmin) 
which corresponds to minimal mobility point Pmin, mobility M 
rapidly increases with the increase in the temperature, and 
a DNA reaches the mobility in a single-stranded state after 
denaturation, i.e. , mobility end Pend- At a temperature higher 
than the temperature (Tend) which corresponds to mobility end 
point Pend, mobility M increases nearly linearly with temperature 
Tagain. It is considered to be due to the decrease in the viscosity 
accompanied with the increase in the temperature, but 
substantially not due to the change in the shape . It is frequently 
observed that minimal mobility point Pmm becomes a discontinuous 
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point, and a band curve becomes discontinuous between the 
temperature which corresponds to minimal mobility pent 

P.,„ and a temperature which is s 1 ightly lower than the temperature 
(Tend) which corresponds to mobility end point ^Pen.- 

Fig . 3 illustrates a shape of the second typical band curve . 
AS shown in Fig. 3, in typical second band curve 61, as 
similarly as in the case of first band curve 51, mobility M 
increases nearly linearly with temperature T at a temperature 
which is lower than the temperature (T.„.) atwhichmeltingbegins , 
and the temperature reaches the temperature (T.ni) whxch 
corresponds to melting initiation point P.ni , andmelting begins , 
and mobility M decreases nearly linearly with temperature T. 
Apart from the case of first band curve 51, however, after a 
DNA reaches minimal mobility point P..n, mobility M increases 
nearly linearly with temperature T again at a temperature wh.ch 
is higher than the temperature which corresponds to minimal 

mobility point P..n. Namely, in second band curve 61, minimal 
mobility point P..„ and mobility end point Pen. conform, so that 
a band curve does not become discontinuous. 
^ Based on the information such as the first differentiation , 

the second differentiation, and its discontinuous points of 
band curves which were obtained via functionalizatxon , 
featuring points such as melting initiation point P.ni, minimal 
mobility point P..n, and mobility end Pe„a are extracted. 

specifically, melting initiation point P.ni is extracted 
as the middle point of a curve part which is surrounded by two 
straight line parts 52 (62) and53 (63) . Hence , melting initiation 
point P.n. is determined based on the first differentiation value 
and the second differentiation value of band curve 51 (61) . 

Minimal mobility point P.i„ is extracted, for example, 
as discontinuous points on a curve in the case of first band 
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V- ^ ^r, ■i-h*^ case of second band curve 
curve 51. On the other hand, m the case or 

■ ^ n -i = Hp-t-prmined as the same point 
61, minimal mobility point P„i„ is determmea 

as'mobility end point Pen. as described later. 

in addition, mobility end point Pen. is determined as 

• -u-K 1 ^ r^o ■n;=irt 54 (64) as observed 
dot which departs from the straight line part 54(6) 

alon, direction based on the first differentiation value 
and the second differentiation value of band curve 51 .61, . 

f^;=^riirina points extraction is 
The above-mentioned featurmy 

carried out one by one fro. a clear band, i.e., a clearly 

subdividedband curve to give a predeterminednu^nber of featuring 

» in case the species of a specimen microorganism 

points as a whole, incase 1-11*:= ^ 

1. already Known, and specific bands whose featuring points 
Should be extracted are evident, featuring pints of these 
specif ic bands are extracted. Eig . 4 illustrates eight featuring 
points (J = l-8> Which were extracted from three band curves^ 
actually, eight featuring points or more are extracted from 

-three band curves or more. 

Mthcugh f eaturingpoints are extracted f romband curves which 
„„e Obtained via aliening and functionali.ation in the 
above-mentioned description, featuring points can also be 
extracted directly from each bold line without slimming or 
f unctionalization . 

f o=<;<; and/or genome semi-distance from 
(4) Determination of PaSS ano/or y= 

identification dots (temperature and mobility) 

.pass- stands for Pattern Similarity score. 'Genome 
semi-distance' is an index which expresses a similarity of 
genomes of two microorganisms using PaSS, and is expressed as 
(l-PaSS,/PaSS. pass and genome semi-distance are determined 
from featuring points (spiddos) which were obtained by the 
above-mentioned extraction as described below . ^ microorganism 
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can be identified by mean, of genotype using PaSS and/or genome 
semi-distance. A microorganism is identified by comparing 
featuring points which were obtained with a genome of a 
microorganism of interest with those of reference genome. 

in case the species of a microorganism of interest is 
already known, the species is used as the reference. 

in case the species of a microorganism of interest rs 
unknown, representative species (e.g.. several tens species, 
Which were previously chosen based on the overall shape of the 
genome profiling image are used as tentative references . 

in case the species of a specimenmicroorganism is unknown , 
species are successively listed up which have similar featuring 
points with respect to each featuring point which was obtained 

^- f^o;:>-hnrina point extraction {e.g., the 

by the above-mentioned featuring puxi 

difference in distance from starting point O to each featuring 
point is 5% or less in Fig. 7, , and the species which were listed 
« up are assumed to be tentative references. 

Then, based on featuring points of reference genomes, 
corresponding featuring points ,e . g ., ten dots, are determined 
from the featuring points which were obtained by the 
above-mentioned featuring points extraction, where.n 
-corresponding featuring points' means featuring points wh.ch 
locally correspond to each of featuring points of reference 
genomes. Corresponding featuring points can be determined, for 
example, by 1, 'the operator specif ioation method' in wh.ch 
an operator manually specifies corresponding featuring pornts 

^ ^rA^.=r^^^^ 2) 'the automatic assignment 
one by one on a computer display, ^) 

method- in which a coordinate zone (a region which occupies 
a two-dimensional specific area in XY coordinate, is determined 
to which each of featuring points of reference genomes belongs , 
and specimen featuring points which belong to the coordinate 
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zone are successively and automatically assigned as 
corresponding featuring points , 3 ) ' the optimally corresponding 
automatic calculation method'' in which the number of 
corresponding featuring points are appropriately set, and 
featuring points are arbitrarily chosen from featuring points 
of reference genomes and extracted specimen featuring points 
so that the number of the featuring points can be each of the 
numbers of corresponding featuring points, and they are 
associated according to the combination theory, and 
corresponding featuring points are automatically determined 
based on the combination which gives the largest PaSS which 
is to be defined later. 

In this way, as shown in Fig. 5, 'n' featuring points Pn 
( i = 1-n) of a specimen microorganism are obtained corresponding 
to ^n' featuring points Poi (i = 1-n) of a reference genome, 
wherein each featuring point Pi (^n' featuring points poi and 
^n' featuring points of a specimen microorganism) has position 
vector Vi (position vector Voi of ^n' featuring points of a 
reference genome and position vector Vn of ^n' featuring points 
of a specimen microorganism) . Point O at which T = the lowest 
temperature at M = 0 can be used as the starting point of position 
vector Vi . 

Using position vector Vi of each featuring point Pi, PaSS 
is obtained which is defined by Eq.l below: 

Pass = 1 - { 2 y (i) }/n (1) 
where 'Z^ denotes the sum of i = 1-n, andy (i) is expressed 
by Eq.2 below: 

y (i) = 2 X I Vii-Voil / ( I Viil + 1 Voil ) (2) 

Although a value of PaSS is determined according to the 
vector (dependent) type calculation method in the 
above-mentioned description, the value can also be determined 
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based on the scalar type calculation method as describe below: 
Pass = 1 - { Er(i) }/n (4) 

where 'E^ denotes the sum of i = 1-n, and r(i) is expressed 
by Eq.5 below: 

r(i) = [ [Tii-Toi] / (T„/a ) (Mii- Mqi) / (Mu + Moi) /2 ] 

( + l2) (5) 

where ' Toi ' denotes a temperature of 'n' featuring points Pqi 
of a reference genome, and ^ Tii ' denotes a temperature of 'n' 
featuring points P,, of a specimen organism. 'Moi' denotes the 
mobility of ^n' featuring points Poi of a reference genome, and 
□ 'Mxi' denotes themobilityof'n' featuringpoints P^, of a specimen 

organism. 'T„' denotes the temperature normalization factor 
which corresponds to a temperature range (e.g., about 60°C) 
which gives featuring points. ' a ' denotes a relative value of 
J^' both 'weights' of mobility change and temperature change, and 

is normally 1. Denominator part ( + 1^)1/2 i^Eq.S constitutes 
the second normalization factor (so that an r(i) value can be 
1 or less) . 

Using Pass which is obtained based on Eq.l or 4, genome 
semi-distance ds is obtained which is defined by Eg. 3 below: 
ds = (1 - PaSS)/PaSS (3) 

In general , in case the species of a specimen microorganism 
is the same as that of a reference genome, a PaSS value (score) 
becomes nearly 1, and a genome semi-distance value becomes 
nearly 0 . 



(5) Identification of microorganism based on PaSS and/or genome 
semi-distance 

In case a species of amicroorganismof interest is unknown , 
Pass, and if necessary, genome semi-distance ds is/are calculated 
for each reference genome. Then, comparison with other reference 
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gencne is repeated until a .aSS value goes eve. a standard value 
,e g 0.96, or a genome semidistance d. value goes below a 
standlrd value (e.g., 0.04, . In this way , a species of a speoimen 
.icrocrganism can be Identified based on a PaSS value or a genome 

semi-distance value. 

on the other hand, in case the species of a microorganrsn, 
which is to be identified is .nown, similarity between 
individuals can be determined which belong to the same spec.es 

• ^^c^a-nn(^ d value. Namely, PaSS 
based on a PaSS or genome semi-distance a, vax 

• Hnc;tance d is determined for two specimen 
or genome semi-dxstance Us J-a 

•c:rr,. which belong to the same species, and in case 
microorganisms whicn oexuny 

J w n^ar or two genome semi-distance 

two Pass values are adequately near or iiwu y 

, d. values are adeguately near, it is determined that the two 

specimen microorganisms belong to the same spec.es and 
similarity between the individuals is hrgh. 

in addition, it is preferable to store by plotting a set 
of species or individuals which exist within a specific genome 
semi-distance in the genome seguence space as neighborrng 
information. 

in this way , it Is preferable to resister genome protrlmg 
images, featuring points. Judgment information, neighboring 

^ /^a-h a ■h;:^ and const itute 
information, and so on as an appropriate database , a 

them so as to be available at any time. 

AS described above, a melting Initiation point whrch 
corresponds to the initiation of melting before denaturation , 
a minimal mobility point where the mobility reaches the minimum 
before strand dissociation, and a mobility end point where a 
DNA first reaches the mobility in a single-stranded state after 
denaturation are extracted as featuring points from a genome 
profiling imag^. « a result, by a guantitative method usrng 
the Obtained featuring points, a species of a microorganrsm 
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and similarity between individuals which are classified into 
the same species of microorganism can be accurately and simply 
identified . 

A method for identifying a species of and a method for 
identifying similarity of a microorganism by its genotype are 
described above. A method according to the present invention 
is not limited to a microorganism, but is also applicable to 
methods for identification such as a method for identifying 
a species of and a method for identifying similarity of a general 
organism . 

Examples 

The present invention will be described in more detail 
by examples below. 

Example 1 

Genome profiling (GP) according to the present invention 
of strains of yeasts, Escherichia coll, and Bacillus suhtllls , 
and mutants therefrom (twenty kinds as a whole) was carried 
out as described below, and their spiddos were determined with 
a computer , and were registered as a data base . Bacillus suhtllls 
was newly isolated from commercially available natto , and DNA 
was extracted, and GP was carried out to give spiddos, which 
were assumed as spiddos of a tentative unknown specimen X. 

Preparation of double-stranded DNA fragment by random PGR 

One hundred Ail of reaction mixture which contains 200 
fi M dNTP (N = G, A, T , C) , 0 . 5 a^M primer {pfM12 (dAGAACGCGCCTG) ) , 
10 mM Tris-HCl (pH 9.0) , 50 mM KCl , 2 . 5 mM MgCl2, 0.1% Triton 

X-100, 0.02 unit/ Ail Taq DNA polymerase, and an appropriate 
amount of template DNA {3 ii 1 of a DNA solution was added, and 
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the volume of the reaction mixture was adjusted to 100 ijl 1) was 
prepared for each DNA, Double-stranded DNA fragments were 
prepared by PGR comprising the steps of 1) the treatment at 
94°C for 1 min, 2) 20-30 times repeating the cycle comprising 
denaturation at 94°C for 30 min, annealing at 28°C for 2 min, 
and elongation at 47 °C for 2 min, followed by 3) the treatment 
at 47°C for 2 min. 



TGGE of double-stranded DNA fragment 

To the double-stranded DNA fragment which was obtained 

Q as described above, 0.3 jiq of DNA which has SEQ ID No . 1 was 

added as standard DNA, and TGGE was carried out. Condition for 

Co 

..p TGGE is as follows: 4% polyacrylamide gel (40 mM Tris buffer 

(pH 8.0) which contains 8M urea) , linear temperature gradient 
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f rom 30 to 70 C , 400V, 110 min, Taitec electrophoresis apparatus 
TG-180 . 



=^ Extraction of identification dots of each DNA fragment from 

in 



electrophoresis pattern 

A band (electrophoresis pattern) of DNA (including 
standard DNA) on an electrophoresis gel which was obtained by 
TGGE was silver stained as described below, and identification 
dots were extracted from each visualized pattern. 

1) A gel which is attached to a gel bond film is directly 
transferred into a plastic container which contains 200 ml of 
a 30% PEG 2000 aqueous solution, and stirred at room temperature 
(15-30^*0) for 30 min. Meanwhile, the gel is detached from the 
film, contracted, and becomes white and translucent. 

2) The solution is once substituted for 150 ml of distilled 
water, and the gel is adequately rinsed. This procedure is 
repeated again. Rinsed water is absolutely sucked with an 
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aspiratpr. a silver 

3, After the r.nsed water 

3tain solution (which was P"^"-^ J „^ ..aistiUea 

and 2 .1 of 25% a^onia aqueous solution t ^^^^^^^^ 
„ater. .ixin,, followed by dissolv.n, 0 . 4 . 

J o-t- ■> T-T-pd for 30 min. 
is added, and stirred ro v.n distilled water , 

4, The liquid is substituted for ^« " ,,,3 procedure 

and stirred for 1 min , and the water rs d.scarde 

is repeated again. 4- 9 nn ml of a developer 

3, Xhe liquid is substituted for abo t » ^ ^^^^ 

,„hich was prepared b. addinq -° " ° ; ,^,,,„., 

,„„ „1 °^-7^ :;;J;;3,a.en until a band is appropriately 
by dissolving) , and genxiy 

stained. ined the developer is 

S) When the band is ^^P'°"'"^"''„^.p,,pared stop solution 
qui=Klyren,oved.and200,„lo£ap ,3 added, and stirred 

,10» acetic aoid + 40% methanol, in water) 

„ ■ to qive a stained gel specimen. 

for about 10 mm to q.ve ^^^^ ^^^^ ^^^^^^^ 

^"r": bor-mentioned method, and spiddos were 
was visualized by the ac ^^^^cted as follows: 

determined. Identification dots were e. rac ^^^^ 

. qel Picture of .P was ^^"-^ 

.nd the obtained image was corrected a 
a 3canner, and he o ^^^^^^^^^ ^^^^ ,,,„cted by a 

with a computer, and i ^ ^ 

method how featurinq points were specified 

display with a mouse^ ^ 

.ne spiddos Which were o ^^^^^^^ 

3pecimen ""^--t ^....ated by the total combination 
a data base, and Pabb wa^. 

calculation method. 

• V, rT^^rps the highest fabo 
A genome which gives 
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automatically extracted from 20 kinds of genomes which were 
previously registered . 

Results were obtained as a computer output, and 
accompanying information revealed that the organism which X 
showed the highest similarity is Bacillus subtilis . 

Example 2 

Random PGR was carried out under the same condition except 
using DNA of Bacillus subtilis as a template, and cy3-pfM12 
(5' cy3-AGA ACG CGC CTG 3') which contains phosphor cy3 or 
FITC/UCS (%' FITC- CA GCA AAC AGC TAT GAG 3') which contains 
phosphor FITC as a primer for each primer, and the obtained 
double-stranded amplified DNA fragments were mixed, and TGGE 
was carried out in a way which is similar to Example 1. An 
electrophoresis pattern by TGGE was visualized with an 
excitation/fluorescent wavelength of 550 nm/570 nm (phosphor 
cy3) or 494 nm/519 nm (phosphor FITC) . Fig. 6a illustrates an 
electrophoresis pattern in the case where cy3-pfM12 (5 ' cy3-AGA 
ACG CGC CTG 3') was used as a primer, and Fig. 6b illustrates 
an electrophoresis pattern in the case where FITC/UCS (5 ' FITC- 
CA GCA AAC AGC TAT GAC 3') was used as a primer. Identification 
dots were extracted from each pattern. The microorganism which 
was used as a template was identified as Bacillus subtilis based 
on the obtained identification dots. 

The present disclosure relates to the subject matter 
contained in Japanese Patent Application No . 2000-123755, filed 
on April 25, 2000, which is expressly incorporated herein by 
reference in its entirely. 

SEQUENCE LISTING 
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